Dataset statistics
| Number of variables | 33 |
|---|---|
| Number of observations | 29986 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 35 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 7.8 MiB |
| Average record size in memory | 272.0 B |
Variable types
| Numeric | 28 |
|---|---|
| Categorical | 5 |
| Dataset has 35 (0.1%) duplicate rows | Duplicates |
SEX is highly correlated with SE_MA and 1 other fields | High correlation |
AGE is highly correlated with AgeBin | High correlation |
BILL_AMT1 is highly correlated with BILL_AMT2 | High correlation |
BILL_AMT2 is highly correlated with BILL_AMT1 and 1 other fields | High correlation |
BILL_AMT3 is highly correlated with BILL_AMT2 and 1 other fields | High correlation |
BILL_AMT4 is highly correlated with BILL_AMT3 and 2 other fields | High correlation |
BILL_AMT5 is highly correlated with BILL_AMT4 and 1 other fields | High correlation |
BILL_AMT6 is highly correlated with BILL_AMT4 and 1 other fields | High correlation |
SE_MA is highly correlated with SEX | High correlation |
AgeBin is highly correlated with AGE | High correlation |
SE_AG is highly correlated with SEX | High correlation |
Closeness_6 is highly correlated with Closeness_5 | High correlation |
Closeness_5 is highly correlated with Closeness_6 and 1 other fields | High correlation |
Closeness_4 is highly correlated with Closeness_5 | High correlation |
Closeness_3 is highly correlated with Closeness_2 | High correlation |
Closeness_2 is highly correlated with Closeness_3 and 1 other fields | High correlation |
Closeness_1 is highly correlated with Closeness_2 | High correlation |
PAY_AMT2 is highly skewed (γ1 = 30.46741983) | Skewed |
PAY_0 has 14733 (49.1%) zeros | Zeros |
PAY_2 has 15726 (52.4%) zeros | Zeros |
PAY_3 has 15762 (52.6%) zeros | Zeros |
PAY_4 has 16452 (54.9%) zeros | Zeros |
PAY_5 has 16944 (56.5%) zeros | Zeros |
PAY_6 has 16285 (54.3%) zeros | Zeros |
BILL_AMT1 has 2004 (6.7%) zeros | Zeros |
BILL_AMT2 has 2504 (8.4%) zeros | Zeros |
BILL_AMT3 has 2869 (9.6%) zeros | Zeros |
BILL_AMT4 has 3194 (10.7%) zeros | Zeros |
BILL_AMT5 has 3502 (11.7%) zeros | Zeros |
BILL_AMT6 has 4017 (13.4%) zeros | Zeros |
PAY_AMT1 has 5247 (17.5%) zeros | Zeros |
PAY_AMT2 has 5394 (18.0%) zeros | Zeros |
PAY_AMT3 has 5966 (19.9%) zeros | Zeros |
PAY_AMT4 has 6405 (21.4%) zeros | Zeros |
PAY_AMT5 has 6700 (22.3%) zeros | Zeros |
PAY_AMT6 has 7168 (23.9%) zeros | Zeros |
Reproduction
| Analysis started | 2021-03-05 11:09:25.972318 |
|---|---|
| Analysis finished | 2021-03-05 11:11:24.286302 |
| Duration | 1 minute and 58.31 seconds |
| Software version | pandas-profiling v2.12.0 |
| Download configuration | config.yaml |
LIMIT_BAL
Real number (ℝ≥0)
| Distinct | 81 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 167461.1379 |
| Minimum | 10000 |
|---|---|
| Maximum | 1000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | 10000 |
|---|---|
| 5-th percentile | 20000 |
| Q1 | 50000 |
| median | 140000 |
| Q3 | 240000 |
| 95-th percentile | 430000 |
| Maximum | 1000000 |
| Range | 990000 |
| Interquartile range (IQR) | 190000 |
Descriptive statistics
| Standard deviation | 129760.9827 |
|---|---|
| Coefficient of variation (CV) | 0.7748722145 |
| Kurtosis | 0.5368061035 |
| Mean | 167461.1379 |
| Median Absolute Deviation (MAD) | 90000 |
| Skewness | 0.9933272738 |
| Sum | 5021489680 |
| Variance | 1.683791264 × 1010 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 50000 | 3364 | 11.2% |
| 20000 | 1976 | 6.6% |
| 30000 | 1610 | 5.4% |
| 80000 | 1567 | 5.2% |
| 200000 | 1526 | 5.1% |
| 150000 | 1109 | 3.7% |
| 100000 | 1047 | 3.5% |
| 180000 | 995 | 3.3% |
| 360000 | 880 | 2.9% |
| 60000 | 825 | 2.8% |
| Other values (71) | 15087 |
| Value | Count | Frequency (%) |
| 10000 | 493 | 1.6% |
| 16000 | 2 | < 0.1% |
| 20000 | 1976 | |
| 30000 | 1610 | |
| 40000 | 230 | 0.8% |
| Value | Count | Frequency (%) |
| 1000000 | 1 | < 0.1% |
| 800000 | 2 | |
| 780000 | 2 | |
| 760000 | 1 | < 0.1% |
| 750000 | 4 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 468.5 KiB |
| 2 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 29986 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 2 | 18106 | |
| 1 | 11880 |
| Value | Count | Frequency (%) |
| 2 | 18106 | |
| 1 | 11880 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 18106 | |
| 1 | 11880 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 29986 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 2 | 18106 | |
| 1 | 11880 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 29986 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 2 | 18106 | |
| 1 | 11880 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29986 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 2 | 18106 | |
| 1 | 11880 |
EDUCATION
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 468.5 KiB |
| 2 | |
|---|---|
| 1 | |
| 3 | |
| 5 | 331 |
| 4 | 123 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 29986 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
| Value | Count | Frequency (%) |
| 2 | 14030 | |
| 1 | 10585 | |
| 3 | 4917 | 16.4% |
| 5 | 331 | 1.1% |
| 4 | 123 | 0.4% |
| Value | Count | Frequency (%) |
| 2 | 14030 | |
| 1 | 10585 | |
| 3 | 4917 | 16.4% |
| 5 | 331 | 1.1% |
| 4 | 123 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 14030 | |
| 1 | 10585 | |
| 3 | 4917 | 16.4% |
| 5 | 331 | 1.1% |
| 4 | 123 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 29986 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 2 | 14030 | |
| 1 | 10585 | |
| 3 | 4917 | 16.4% |
| 5 | 331 | 1.1% |
| 4 | 123 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 29986 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 2 | 14030 | |
| 1 | 10585 | |
| 3 | 4917 | 16.4% |
| 5 | 331 | 1.1% |
| 4 | 123 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29986 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 2 | 14030 | |
| 1 | 10585 | |
| 3 | 4917 | 16.4% |
| 5 | 331 | 1.1% |
| 4 | 123 | 0.4% |
MARRIAGE
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 468.5 KiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 377 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 29986 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 2 | 15954 | |
| 1 | 13655 | |
| 3 | 377 | 1.3% |
| Value | Count | Frequency (%) |
| 2 | 15954 | |
| 1 | 13655 | |
| 3 | 377 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 15954 | |
| 1 | 13655 | |
| 3 | 377 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 29986 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 2 | 15954 | |
| 1 | 13655 | |
| 3 | 377 | 1.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 29986 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 2 | 15954 | |
| 1 | 13655 | |
| 3 | 377 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29986 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 2 | 15954 | |
| 1 | 13655 | |
| 3 | 377 | 1.3% |
| Distinct | 56 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.48392583 |
| Minimum | 21 |
|---|---|
| Maximum | 79 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | 21 |
|---|---|
| 5-th percentile | 23 |
| Q1 | 28 |
| median | 34 |
| Q3 | 41 |
| 95-th percentile | 53 |
| Maximum | 79 |
| Range | 58 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 9.218534723 |
|---|---|
| Coefficient of variation (CV) | 0.2597946678 |
| Kurtosis | 0.04472938413 |
| Mean | 35.48392583 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.7325757746 |
| Sum | 1064021 |
| Variance | 84.98138243 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 29 | 1605 | 5.4% |
| 27 | 1477 | 4.9% |
| 28 | 1408 | 4.7% |
| 30 | 1393 | 4.6% |
| 26 | 1256 | 4.2% |
| 31 | 1217 | 4.1% |
| 25 | 1186 | 4.0% |
| 34 | 1162 | 3.9% |
| 32 | 1158 | 3.9% |
| 33 | 1146 | 3.8% |
| Other values (46) | 16978 |
| Value | Count | Frequency (%) |
| 21 | 67 | 0.2% |
| 22 | 560 | |
| 23 | 931 | |
| 24 | 1127 | |
| 25 | 1186 |
| Value | Count | Frequency (%) |
| 79 | 1 | < 0.1% |
| 75 | 3 | |
| 74 | 1 | < 0.1% |
| 73 | 4 | |
| 72 | 3 |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.0164743547 |
| Minimum | -2 |
|---|---|
| Maximum | 8 |
| Zeros | 14733 |
| Zeros (%) | 49.1% |
| Negative | 8438 |
| Negative (%) | 28.1% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.123785345 |
|---|---|
| Coefficient of variation (CV) | -68.21422544 |
| Kurtosis | 2.722011751 |
| Mean | -0.0164743547 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7323112099 |
| Sum | -494 |
| Variance | 1.262893502 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 14733 | |
| -1 | 5682 | 18.9% |
| 1 | 3685 | 12.3% |
| -2 | 2756 | 9.2% |
| 2 | 2667 | 8.9% |
| 3 | 322 | 1.1% |
| 4 | 76 | 0.3% |
| 5 | 26 | 0.1% |
| 8 | 19 | 0.1% |
| 6 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| -2 | 2756 | 9.2% |
| -1 | 5682 | 18.9% |
| 0 | 14733 | |
| 1 | 3685 | 12.3% |
| 2 | 2667 | 8.9% |
| Value | Count | Frequency (%) |
| 8 | 19 | 0.1% |
| 7 | 9 | < 0.1% |
| 6 | 11 | < 0.1% |
| 5 | 26 | 0.1% |
| 4 | 76 |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.1333622357 |
| Minimum | -2 |
|---|---|
| Maximum | 8 |
| Zeros | 15726 |
| Zeros (%) | 52.4% |
| Negative | 9822 |
| Negative (%) | 32.8% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.19720764 |
|---|---|
| Coefficient of variation (CV) | -8.977111352 |
| Kurtosis | 1.570309426 |
| Mean | -0.1333622357 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.7904586596 |
| Sum | -3999 |
| Variance | 1.433306134 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 15726 | |
| -1 | 6044 | 20.2% |
| 2 | 3927 | 13.1% |
| -2 | 3778 | 12.6% |
| 3 | 326 | 1.1% |
| 4 | 99 | 0.3% |
| 1 | 28 | 0.1% |
| 5 | 25 | 0.1% |
| 7 | 20 | 0.1% |
| 6 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| -2 | 3778 | 12.6% |
| -1 | 6044 | 20.2% |
| 0 | 15726 | |
| 1 | 28 | 0.1% |
| 2 | 3927 | 13.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 7 | 20 | 0.1% |
| 6 | 12 | < 0.1% |
| 5 | 25 | 0.1% |
| 4 | 99 |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.1658440606 |
| Minimum | -2 |
|---|---|
| Maximum | 8 |
| Zeros | 15762 |
| Zeros (%) | 52.6% |
| Negative | 10012 |
| Negative (%) | 33.4% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.19682557 |
|---|---|
| Coefficient of variation (CV) | -7.216571797 |
| Kurtosis | 2.085375266 |
| Mean | -0.1658440606 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.8406315527 |
| Sum | -4973 |
| Variance | 1.432391445 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 15762 | |
| -1 | 5931 | 19.8% |
| -2 | 4081 | 13.6% |
| 2 | 3818 | 12.7% |
| 3 | 240 | 0.8% |
| 4 | 76 | 0.3% |
| 7 | 27 | 0.1% |
| 6 | 23 | 0.1% |
| 5 | 21 | 0.1% |
| 1 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| -2 | 4081 | 13.6% |
| -1 | 5931 | 19.8% |
| 0 | 15762 | |
| 1 | 4 | < 0.1% |
| 2 | 3818 | 12.7% |
| Value | Count | Frequency (%) |
| 8 | 3 | < 0.1% |
| 7 | 27 | 0.1% |
| 6 | 23 | 0.1% |
| 5 | 21 | 0.1% |
| 4 | 76 |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.2203695058 |
| Minimum | -2 |
|---|---|
| Maximum | 8 |
| Zeros | 16452 |
| Zeros (%) | 54.9% |
| Negative | 10025 |
| Negative (%) | 33.4% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.169106503 |
|---|---|
| Coefficient of variation (CV) | -5.305209987 |
| Kurtosis | 3.498525213 |
| Mean | -0.2203695058 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.9997163765 |
| Sum | -6608 |
| Variance | 1.366810015 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16452 | |
| -1 | 5681 | 18.9% |
| -2 | 4344 | 14.5% |
| 2 | 3158 | 10.5% |
| 3 | 180 | 0.6% |
| 4 | 69 | 0.2% |
| 7 | 58 | 0.2% |
| 5 | 35 | 0.1% |
| 6 | 5 | < 0.1% |
| 8 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| -2 | 4344 | 14.5% |
| -1 | 5681 | 18.9% |
| 0 | 16452 | |
| 1 | 2 | < 0.1% |
| 2 | 3158 | 10.5% |
| Value | Count | Frequency (%) |
| 8 | 2 | < 0.1% |
| 7 | 58 | |
| 6 | 5 | < 0.1% |
| 5 | 35 | |
| 4 | 69 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.2658240512 |
| Minimum | -2 |
|---|---|
| Maximum | 8 |
| Zeros | 16944 |
| Zeros (%) | 56.5% |
| Negative | 10074 |
| Negative (%) | 33.6% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.133216354 |
|---|---|
| Coefficient of variation (CV) | -4.26303169 |
| Kurtosis | 3.990186627 |
| Mean | -0.2658240512 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.008135047 |
| Sum | -7971 |
| Variance | 1.284179306 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16944 | |
| -1 | 5532 | 18.4% |
| -2 | 4542 | 15.1% |
| 2 | 2626 | 8.8% |
| 3 | 178 | 0.6% |
| 4 | 84 | 0.3% |
| 7 | 58 | 0.2% |
| 5 | 17 | 0.1% |
| 6 | 4 | < 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| -2 | 4542 | 15.1% |
| -1 | 5532 | 18.4% |
| 0 | 16944 | |
| 2 | 2626 | 8.8% |
| 3 | 178 | 0.6% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 7 | 58 | |
| 6 | 4 | < 0.1% |
| 5 | 17 | 0.1% |
| 4 | 84 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.2906022811 |
| Minimum | -2 |
|---|---|
| Maximum | 8 |
| Zeros | 16285 |
| Zeros (%) | 54.3% |
| Negative | 10622 |
| Negative (%) | 35.4% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.149949673 |
|---|---|
| Coefficient of variation (CV) | -3.957125417 |
| Kurtosis | 3.427731675 |
| Mean | -0.2906022811 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.9479783161 |
| Sum | -8714 |
| Variance | 1.32238425 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16285 | |
| -1 | 5733 | 19.1% |
| -2 | 4889 | 16.3% |
| 2 | 2766 | 9.2% |
| 3 | 184 | 0.6% |
| 4 | 49 | 0.2% |
| 7 | 46 | 0.2% |
| 6 | 19 | 0.1% |
| 5 | 13 | < 0.1% |
| 8 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| -2 | 4889 | 16.3% |
| -1 | 5733 | 19.1% |
| 0 | 16285 | |
| 2 | 2766 | 9.2% |
| 3 | 184 | 0.6% |
| Value | Count | Frequency (%) |
| 8 | 2 | < 0.1% |
| 7 | 46 | |
| 6 | 19 | 0.1% |
| 5 | 13 | < 0.1% |
| 4 | 49 |
| Distinct | 22715 |
|---|---|
| Distinct (%) | 75.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51241.74595 |
| Minimum | -165580 |
|---|---|
| Maximum | 964511 |
| Zeros | 2004 |
| Zeros (%) | 6.7% |
| Negative | 590 |
| Negative (%) | 2.0% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -165580 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3564.25 |
| median | 22393.5 |
| Q3 | 67141.25 |
| 95-th percentile | 201217.75 |
| Maximum | 964511 |
| Range | 1130091 |
| Interquartile range (IQR) | 63577 |
Descriptive statistics
| Standard deviation | 73647.45693 |
|---|---|
| Coefficient of variation (CV) | 1.437255027 |
| Kurtosis | 9.801475321 |
| Mean | 51241.74595 |
| Median Absolute Deviation (MAD) | 21810.5 |
| Skewness | 2.663192136 |
| Sum | 1536534994 |
| Variance | 5423947912 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2004 | 6.7% |
| 390 | 244 | 0.8% |
| 780 | 76 | 0.3% |
| 326 | 72 | 0.2% |
| 316 | 63 | 0.2% |
| 2500 | 59 | 0.2% |
| 396 | 49 | 0.2% |
| 2400 | 39 | 0.1% |
| 416 | 29 | 0.1% |
| 500 | 25 | 0.1% |
| Other values (22705) | 27326 |
| Value | Count | Frequency (%) |
| -165580 | 1 | |
| -154973 | 1 | |
| -15308 | 1 | |
| -14386 | 1 | |
| -11545 | 1 |
| Value | Count | Frequency (%) |
| 964511 | 1 | |
| 746814 | 1 | |
| 653062 | 1 | |
| 630458 | 1 | |
| 626648 | 1 |
| Distinct | 22339 |
|---|---|
| Distinct (%) | 74.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49197.34079 |
| Minimum | -69777 |
|---|---|
| Maximum | 983931 |
| Zeros | 2504 |
| Zeros (%) | 8.4% |
| Negative | 669 |
| Negative (%) | 2.2% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -69777 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2986 |
| median | 21216 |
| Q3 | 64027.75 |
| 95-th percentile | 194795 |
| Maximum | 983931 |
| Range | 1053708 |
| Interquartile range (IQR) | 61041.75 |
Descriptive statistics
| Standard deviation | 71184.82114 |
|---|---|
| Coefficient of variation (CV) | 1.446924163 |
| Kurtosis | 10.29805535 |
| Mean | 49197.34079 |
| Median Absolute Deviation (MAD) | 20826 |
| Skewness | 2.704551617 |
| Sum | 1475231461 |
| Variance | 5067278760 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2504 | 8.4% |
| 390 | 231 | 0.8% |
| 326 | 75 | 0.3% |
| 780 | 75 | 0.3% |
| 316 | 72 | 0.2% |
| 2500 | 51 | 0.2% |
| 396 | 51 | 0.2% |
| 2400 | 42 | 0.1% |
| -200 | 29 | 0.1% |
| 416 | 28 | 0.1% |
| Other values (22329) | 26828 |
| Value | Count | Frequency (%) |
| -69777 | 1 | |
| -67526 | 1 | |
| -33350 | 1 | |
| -30000 | 1 | |
| -26214 | 1 |
| Value | Count | Frequency (%) |
| 983931 | 1 | |
| 743970 | 1 | |
| 671563 | 1 | |
| 646770 | 1 | |
| 624475 | 1 |
| Distinct | 22015 |
|---|---|
| Distinct (%) | 73.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47026.92403 |
| Minimum | -157264 |
|---|---|
| Maximum | 1664089 |
| Zeros | 2869 |
| Zeros (%) | 9.6% |
| Negative | 655 |
| Negative (%) | 2.2% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -157264 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2667.25 |
| median | 20091.5 |
| Q3 | 60174.5 |
| 95-th percentile | 187835.75 |
| Maximum | 1664089 |
| Range | 1821353 |
| Interquartile range (IQR) | 57507.25 |
Descriptive statistics
| Standard deviation | 69360.88417 |
|---|---|
| Coefficient of variation (CV) | 1.474918583 |
| Kurtosis | 19.77628324 |
| Mean | 47026.92403 |
| Median Absolute Deviation (MAD) | 19711.5 |
| Skewness | 3.087219116 |
| Sum | 1410149344 |
| Variance | 4810932253 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2869 | 9.6% |
| 390 | 275 | 0.9% |
| 780 | 74 | 0.2% |
| 326 | 63 | 0.2% |
| 316 | 62 | 0.2% |
| 396 | 48 | 0.2% |
| 2500 | 40 | 0.1% |
| 2400 | 39 | 0.1% |
| 416 | 29 | 0.1% |
| 200 | 26 | 0.1% |
| Other values (22005) | 26461 |
| Value | Count | Frequency (%) |
| -157264 | 1 | |
| -61506 | 1 | |
| -46127 | 1 | |
| -34041 | 1 | |
| -25443 | 1 |
| Value | Count | Frequency (%) |
| 1664089 | 1 | |
| 855086 | 1 | |
| 693131 | 1 | |
| 689643 | 1 | |
| 689627 | 1 |
| Distinct | 21540 |
|---|---|
| Distinct (%) | 71.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43276.91476 |
| Minimum | -170000 |
|---|---|
| Maximum | 891586 |
| Zeros | 3194 |
| Zeros (%) | 10.7% |
| Negative | 675 |
| Negative (%) | 2.3% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -170000 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2329.25 |
| median | 19056 |
| Q3 | 54559.5 |
| 95-th percentile | 174380.25 |
| Maximum | 891586 |
| Range | 1061586 |
| Interquartile range (IQR) | 52230.25 |
Descriptive statistics
| Standard deviation | 64343.80078 |
|---|---|
| Coefficient of variation (CV) | 1.486792696 |
| Kurtosis | 11.303771 |
| Mean | 43276.91476 |
| Median Absolute Deviation (MAD) | 18660 |
| Skewness | 2.8212682 |
| Sum | 1297701566 |
| Variance | 4140124699 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3194 | 10.7% |
| 390 | 246 | 0.8% |
| 780 | 101 | 0.3% |
| 316 | 68 | 0.2% |
| 326 | 62 | 0.2% |
| 396 | 44 | 0.1% |
| 2400 | 39 | 0.1% |
| 150 | 39 | 0.1% |
| 2500 | 34 | 0.1% |
| 416 | 33 | 0.1% |
| Other values (21530) | 26126 |
| Value | Count | Frequency (%) |
| -170000 | 1 | |
| -81334 | 1 | |
| -65167 | 1 | |
| -50616 | 1 | |
| -46627 | 1 |
| Value | Count | Frequency (%) |
| 891586 | 1 | |
| 706864 | 1 | |
| 628699 | 1 | |
| 616836 | 1 | |
| 572805 | 1 |
| Distinct | 21005 |
|---|---|
| Distinct (%) | 70.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40326.76256 |
| Minimum | -81334 |
|---|---|
| Maximum | 927171 |
| Zeros | 3502 |
| Zeros (%) | 11.7% |
| Negative | 655 |
| Negative (%) | 2.2% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -81334 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1765.75 |
| median | 18118.5 |
| Q3 | 50220.75 |
| 95-th percentile | 165798.5 |
| Maximum | 927171 |
| Range | 1008505 |
| Interquartile range (IQR) | 48455 |
Descriptive statistics
| Standard deviation | 60806.54835 |
|---|---|
| Coefficient of variation (CV) | 1.507846018 |
| Kurtosis | 12.30059999 |
| Mean | 40326.76256 |
| Median Absolute Deviation (MAD) | 17702.5 |
| Skewness | 2.875731101 |
| Sum | 1209238302 |
| Variance | 3697436322 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3502 | 11.7% |
| 390 | 235 | 0.8% |
| 780 | 94 | 0.3% |
| 316 | 79 | 0.3% |
| 326 | 62 | 0.2% |
| 150 | 58 | 0.2% |
| 396 | 47 | 0.2% |
| 2400 | 39 | 0.1% |
| 2500 | 37 | 0.1% |
| 416 | 36 | 0.1% |
| Other values (20995) | 25797 |
| Value | Count | Frequency (%) |
| -81334 | 1 | |
| -61372 | 1 | |
| -53007 | 1 | |
| -46627 | 1 | |
| -37594 | 1 |
| Value | Count | Frequency (%) |
| 927171 | 1 | |
| 823540 | 1 | |
| 587067 | 1 | |
| 551702 | 1 | |
| 547880 | 1 |
| Distinct | 20597 |
|---|---|
| Distinct (%) | 68.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38887.44718 |
| Minimum | -339603 |
|---|---|
| Maximum | 961664 |
| Zeros | 4017 |
| Zeros (%) | 13.4% |
| Negative | 688 |
| Negative (%) | 2.3% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -339603 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1257 |
| median | 17097.5 |
| Q3 | 49227.75 |
| 95-th percentile | 161912 |
| Maximum | 961664 |
| Range | 1301267 |
| Interquartile range (IQR) | 47970.75 |
Descriptive statistics
| Standard deviation | 59563.15405 |
|---|---|
| Coefficient of variation (CV) | 1.531680745 |
| Kurtosis | 12.26549564 |
| Mean | 38887.44718 |
| Median Absolute Deviation (MAD) | 16781.5 |
| Skewness | 2.845986675 |
| Sum | 1166078991 |
| Variance | 3547769321 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4017 | 13.4% |
| 390 | 207 | 0.7% |
| 780 | 86 | 0.3% |
| 150 | 78 | 0.3% |
| 316 | 77 | 0.3% |
| 326 | 56 | 0.2% |
| 396 | 45 | 0.2% |
| 416 | 36 | 0.1% |
| -18 | 33 | 0.1% |
| 2400 | 32 | 0.1% |
| Other values (20587) | 25319 |
| Value | Count | Frequency (%) |
| -339603 | 1 | |
| -209051 | 1 | |
| -150953 | 1 | |
| -94625 | 1 | |
| -73895 | 1 |
| Value | Count | Frequency (%) |
| 961664 | 1 | |
| 699944 | 1 | |
| 568638 | 1 | |
| 527711 | 1 | |
| 527566 | 1 |
| Distinct | 7939 |
|---|---|
| Distinct (%) | 26.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5663.448743 |
| Minimum | 0 |
|---|---|
| Maximum | 873552 |
| Zeros | 5247 |
| Zeros (%) | 17.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1000 |
| median | 2100 |
| Q3 | 5006 |
| 95-th percentile | 18425.25 |
| Maximum | 873552 |
| Range | 873552 |
| Interquartile range (IQR) | 4006 |
Descriptive statistics
| Standard deviation | 16566.50987 |
|---|---|
| Coefficient of variation (CV) | 2.92516285 |
| Kurtosis | 415.1242799 |
| Mean | 5663.448743 |
| Median Absolute Deviation (MAD) | 1931.5 |
| Skewness | 14.66660918 |
| Sum | 169824174 |
| Variance | 274449249.2 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5247 | 17.5% |
| 2000 | 1363 | 4.5% |
| 3000 | 891 | 3.0% |
| 5000 | 698 | 2.3% |
| 1500 | 507 | 1.7% |
| 4000 | 426 | 1.4% |
| 10000 | 401 | 1.3% |
| 1000 | 365 | 1.2% |
| 2500 | 298 | 1.0% |
| 6000 | 294 | 1.0% |
| Other values (7929) | 19496 |
| Value | Count | Frequency (%) |
| 0 | 5247 | |
| 1 | 9 | < 0.1% |
| 2 | 14 | < 0.1% |
| 3 | 15 | 0.1% |
| 4 | 18 | 0.1% |
| Value | Count | Frequency (%) |
| 873552 | 1 | |
| 505000 | 1 | |
| 493358 | 1 | |
| 423903 | 1 | |
| 405016 | 1 |
| Distinct | 7892 |
|---|---|
| Distinct (%) | 26.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5917.844061 |
| Minimum | 0 |
|---|---|
| Maximum | 1684259 |
| Zeros | 5394 |
| Zeros (%) | 18.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 833 |
| median | 2009 |
| Q3 | 5000 |
| 95-th percentile | 19001.75 |
| Maximum | 1684259 |
| Range | 1684259 |
| Interquartile range (IQR) | 4167 |
Descriptive statistics
| Standard deviation | 23040.83551 |
|---|---|
| Coefficient of variation (CV) | 3.893450939 |
| Kurtosis | 1642.424329 |
| Mean | 5917.844061 |
| Median Absolute Deviation (MAD) | 1991 |
| Skewness | 30.46741983 |
| Sum | 177452472 |
| Variance | 530880101.1 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5394 | 18.0% |
| 2000 | 1290 | 4.3% |
| 3000 | 857 | 2.9% |
| 5000 | 717 | 2.4% |
| 1000 | 594 | 2.0% |
| 1500 | 521 | 1.7% |
| 4000 | 410 | 1.4% |
| 10000 | 318 | 1.1% |
| 6000 | 283 | 0.9% |
| 2500 | 251 | 0.8% |
| Other values (7882) | 19351 |
| Value | Count | Frequency (%) |
| 0 | 5394 | |
| 1 | 15 | 0.1% |
| 2 | 20 | 0.1% |
| 3 | 18 | 0.1% |
| 4 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 1684259 | 1 | |
| 1227082 | 1 | |
| 1215471 | 1 | |
| 1024516 | 1 | |
| 580464 | 1 |
| Distinct | 7512 |
|---|---|
| Distinct (%) | 25.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5224.000967 |
| Minimum | 0 |
|---|---|
| Maximum | 896040 |
| Zeros | 5966 |
| Zeros (%) | 19.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 390 |
| median | 1800 |
| Q3 | 4503 |
| 95-th percentile | 17534.75 |
| Maximum | 896040 |
| Range | 896040 |
| Interquartile range (IQR) | 4113 |
Descriptive statistics
| Standard deviation | 17609.29387 |
|---|---|
| Coefficient of variation (CV) | 3.370844298 |
| Kurtosis | 564.2817568 |
| Mean | 5224.000967 |
| Median Absolute Deviation (MAD) | 1795 |
| Skewness | 17.21788413 |
| Sum | 156646893 |
| Variance | 310087230.7 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5966 | 19.9% |
| 2000 | 1285 | 4.3% |
| 1000 | 1102 | 3.7% |
| 3000 | 870 | 2.9% |
| 5000 | 721 | 2.4% |
| 1500 | 490 | 1.6% |
| 4000 | 381 | 1.3% |
| 10000 | 312 | 1.0% |
| 1200 | 243 | 0.8% |
| 6000 | 241 | 0.8% |
| Other values (7502) | 18375 |
| Value | Count | Frequency (%) |
| 0 | 5966 | |
| 1 | 13 | < 0.1% |
| 2 | 19 | 0.1% |
| 3 | 14 | < 0.1% |
| 4 | 15 | 0.1% |
| Value | Count | Frequency (%) |
| 896040 | 1 | |
| 889043 | 1 | |
| 508229 | 1 | |
| 417588 | 1 | |
| 400972 | 1 |
| Distinct | 6933 |
|---|---|
| Distinct (%) | 23.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4826.639699 |
| Minimum | 0 |
|---|---|
| Maximum | 621000 |
| Zeros | 6405 |
| Zeros (%) | 21.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 296 |
| median | 1500 |
| Q3 | 4013.75 |
| 95-th percentile | 16011.75 |
| Maximum | 621000 |
| Range | 621000 |
| Interquartile range (IQR) | 3717.75 |
Descriptive statistics
| Standard deviation | 15669.21268 |
|---|---|
| Coefficient of variation (CV) | 3.246401981 |
| Kurtosis | 277.2442185 |
| Mean | 4826.639699 |
| Median Absolute Deviation (MAD) | 1500 |
| Skewness | 12.90329242 |
| Sum | 144731618 |
| Variance | 245524226 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6405 | 21.4% |
| 1000 | 1394 | 4.6% |
| 2000 | 1214 | 4.0% |
| 3000 | 887 | 3.0% |
| 5000 | 810 | 2.7% |
| 1500 | 441 | 1.5% |
| 4000 | 402 | 1.3% |
| 10000 | 341 | 1.1% |
| 500 | 258 | 0.9% |
| 2500 | 258 | 0.9% |
| Other values (6923) | 17576 |
| Value | Count | Frequency (%) |
| 0 | 6405 | |
| 1 | 22 | 0.1% |
| 2 | 22 | 0.1% |
| 3 | 13 | < 0.1% |
| 4 | 20 | 0.1% |
| Value | Count | Frequency (%) |
| 621000 | 1 | |
| 528897 | 1 | |
| 497000 | 1 | |
| 432130 | 1 | |
| 400046 | 1 |
| Distinct | 6892 |
|---|---|
| Distinct (%) | 23.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4800.441706 |
| Minimum | 0 |
|---|---|
| Maximum | 426529 |
| Zeros | 6700 |
| Zeros (%) | 22.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 251.5 |
| median | 1500 |
| Q3 | 4031 |
| 95-th percentile | 16000 |
| Maximum | 426529 |
| Range | 426529 |
| Interquartile range (IQR) | 3779.5 |
Descriptive statistics
| Standard deviation | 15281.70818 |
|---|---|
| Coefficient of variation (CV) | 3.183396261 |
| Kurtosis | 179.9832984 |
| Mean | 4800.441706 |
| Median Absolute Deviation (MAD) | 1500 |
| Skewness | 11.12497653 |
| Sum | 143946045 |
| Variance | 233530604.9 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6700 | 22.3% |
| 1000 | 1340 | 4.5% |
| 2000 | 1323 | 4.4% |
| 3000 | 947 | 3.2% |
| 5000 | 814 | 2.7% |
| 1500 | 426 | 1.4% |
| 4000 | 401 | 1.3% |
| 10000 | 343 | 1.1% |
| 500 | 250 | 0.8% |
| 6000 | 247 | 0.8% |
| Other values (6882) | 17195 |
| Value | Count | Frequency (%) |
| 0 | 6700 | |
| 1 | 21 | 0.1% |
| 2 | 13 | < 0.1% |
| 3 | 13 | < 0.1% |
| 4 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 426529 | 1 | |
| 417990 | 1 | |
| 388071 | 1 | |
| 379267 | 1 | |
| 332000 | 1 |
| Distinct | 6936 |
|---|---|
| Distinct (%) | 23.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5216.533582 |
| Minimum | 0 |
|---|---|
| Maximum | 528666 |
| Zeros | 7168 |
| Zeros (%) | 23.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 118 |
| median | 1500 |
| Q3 | 4000 |
| 95-th percentile | 17369 |
| Maximum | 528666 |
| Range | 528666 |
| Interquartile range (IQR) | 3882 |
Descriptive statistics
| Standard deviation | 17781.32692 |
|---|---|
| Coefficient of variation (CV) | 3.408648029 |
| Kurtosis | 167.0906005 |
| Mean | 5216.533582 |
| Median Absolute Deviation (MAD) | 1500 |
| Skewness | 10.63858739 |
| Sum | 156422976 |
| Variance | 316175586.9 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7168 | |
| 1000 | 1299 | 4.3% |
| 2000 | 1295 | 4.3% |
| 3000 | 914 | 3.0% |
| 5000 | 808 | 2.7% |
| 1500 | 439 | 1.5% |
| 4000 | 411 | 1.4% |
| 10000 | 356 | 1.2% |
| 500 | 247 | 0.8% |
| 6000 | 220 | 0.7% |
| Other values (6926) | 16829 |
| Value | Count | Frequency (%) |
| 0 | 7168 | |
| 1 | 20 | 0.1% |
| 2 | 9 | < 0.1% |
| 3 | 14 | < 0.1% |
| 4 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 528666 | 1 | |
| 527143 | 1 | |
| 443001 | 1 | |
| 422000 | 1 | |
| 403500 | 1 |
Default
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 468.5 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 29986 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 23350 | |
| 1 | 6636 | 22.1% |
| Value | Count | Frequency (%) |
| 0 | 23350 | |
| 1 | 6636 | 22.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 23350 | |
| 1 | 6636 | 22.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 29986 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 23350 | |
| 1 | 6636 | 22.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 29986 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 23350 | |
| 1 | 6636 | 22.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29986 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 23350 | |
| 1 | 6636 | 22.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.368638698 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.543187062 |
|---|---|
| Coefficient of variation (CV) | 0.4581040593 |
| Kurtosis | -1.439433511 |
| Mean | 3.368638698 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.3497360266 |
| Sum | 101012 |
| Variance | 2.381426308 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 9407 | |
| 4 | 8467 | |
| 2 | 6547 | |
| 1 | 5188 | |
| 6 | 232 | 0.8% |
| 3 | 145 | 0.5% |
| Value | Count | Frequency (%) |
| 1 | 5188 | |
| 2 | 6547 | |
| 3 | 145 | 0.5% |
| 4 | 8467 | |
| 5 | 9407 |
| Value | Count | Frequency (%) |
| 6 | 232 | 0.8% |
| 5 | 9407 | |
| 4 | 8467 | |
| 3 | 145 | 0.5% |
| 2 | 6547 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 468.5 KiB |
| 2 | |
|---|---|
| 1 | |
| 3 | |
| 4 | |
| 5 | 339 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 29986 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 4 |
| Value | Count | Frequency (%) |
| 2 | 11231 | |
| 1 | 9617 | |
| 3 | 6459 | |
| 4 | 2340 | 7.8% |
| 5 | 339 | 1.1% |
| Value | Count | Frequency (%) |
| 2 | 11231 | |
| 1 | 9617 | |
| 3 | 6459 | |
| 4 | 2340 | 7.8% |
| 5 | 339 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 11231 | |
| 1 | 9617 | |
| 3 | 6459 | |
| 4 | 2340 | 7.8% |
| 5 | 339 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 29986 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 2 | 11231 | |
| 1 | 9617 | |
| 3 | 6459 | |
| 4 | 2340 | 7.8% |
| 5 | 339 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 29986 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 2 | 11231 | |
| 1 | 9617 | |
| 3 | 6459 | |
| 4 | 2340 | 7.8% |
| 5 | 339 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29986 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 2 | 11231 | |
| 1 | 9617 | |
| 3 | 6459 | |
| 4 | 2340 | 7.8% |
| 5 | 339 | 1.1% |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.103748416 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 6 |
| Q3 | 7 |
| 95-th percentile | 8 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.553899468 |
|---|---|
| Coefficient of variation (CV) | 0.5003968181 |
| Kurtosis | -1.323168512 |
| Mean | 5.103748416 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.3163160542 |
| Sum | 153041 |
| Variance | 6.522402491 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 6670 | |
| 6 | 6337 | |
| 2 | 4561 | |
| 8 | 3691 | |
| 1 | 3280 | |
| 3 | 2768 | |
| 9 | 1248 | 4.2% |
| 4 | 1092 | 3.6% |
| 5 | 179 | 0.6% |
| 10 | 160 | 0.5% |
| Value | Count | Frequency (%) |
| 1 | 3280 | |
| 2 | 4561 | |
| 3 | 2768 | |
| 4 | 1092 | 3.6% |
| 5 | 179 | 0.6% |
| Value | Count | Frequency (%) |
| 10 | 160 | 0.5% |
| 9 | 1248 | 4.2% |
| 8 | 3691 | |
| 7 | 6670 | |
| 6 | 6337 |
| Distinct | 23644 |
|---|---|
| Distinct (%) | 78.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6812992165 |
| Minimum | -2.88555 |
|---|---|
| Maximum | 2.50953 |
| Zeros | 28 |
| Zeros (%) | 0.1% |
| Negative | 798 |
| Negative (%) | 2.7% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -2.88555 |
|---|---|
| 5-th percentile | 0.026124375 |
| Q1 | 0.4177436905 |
| median | 0.81459 |
| Q3 | 0.99219875 |
| 95-th percentile | 1 |
| Maximum | 2.50953 |
| Range | 5.39508 |
| Interquartile range (IQR) | 0.5744550595 |
Descriptive statistics
| Standard deviation | 0.3453089827 |
|---|---|
| Coefficient of variation (CV) | 0.5068389546 |
| Kurtosis | 0.1707189498 |
| Mean | 0.6812992165 |
| Median Absolute Deviation (MAD) | 0.18541 |
| Skewness | -0.8546532032 |
| Sum | 20429.43831 |
| Variance | 0.1192382935 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 4017 | 13.4% |
| 0.9922 | 42 | 0.1% |
| 0.997 | 31 | 0.1% |
| 0.961 | 31 | 0.1% |
| 0 | 28 | 0.1% |
| 0.995125 | 26 | 0.1% |
| 0.99805 | 19 | 0.1% |
| 0.99 | 16 | 0.1% |
| 0.9805 | 16 | 0.1% |
| 0.987 | 15 | 0.1% |
| Other values (23634) | 25745 |
| Value | Count | Frequency (%) |
| -2.88555 | 1 | |
| -1.6941 | 1 | |
| -1.442066667 | 1 | |
| -1.41605 | 1 | |
| -1.19405 | 1 |
| Value | Count | Frequency (%) |
| 2.50953 | 1 | |
| 2.212867857 | 1 | |
| 1.75 | 1 | |
| 1.720865517 | 1 | |
| 1.5805 | 1 |
| Distinct | 24065 |
|---|---|
| Distinct (%) | 80.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6667713841 |
| Minimum | -3.9355 |
|---|---|
| Maximum | 1.876742857 |
| Zeros | 20 |
| Zeros (%) | 0.1% |
| Negative | 820 |
| Negative (%) | 2.7% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -3.9355 |
|---|---|
| 5-th percentile | 0.0257 |
| Q1 | 0.39750625 |
| median | 0.7877099379 |
| Q3 | 0.9888571429 |
| 95-th percentile | 1 |
| Maximum | 1.876742857 |
| Range | 5.812242857 |
| Interquartile range (IQR) | 0.5913508929 |
Descriptive statistics
| Standard deviation | 0.3505512357 |
|---|---|
| Coefficient of variation (CV) | 0.5257442717 |
| Kurtosis | 1.807782789 |
| Mean | 0.6667713841 |
| Median Absolute Deviation (MAD) | 0.2117170318 |
| Skewness | -0.9281182575 |
| Sum | 19993.80672 |
| Variance | 0.1228861689 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3502 | 11.7% |
| 0.9922 | 39 | 0.1% |
| 0.9805 | 36 | 0.1% |
| 0.961 | 34 | 0.1% |
| 0.995125 | 30 | 0.1% |
| 0.997 | 23 | 0.1% |
| 0 | 20 | 0.1% |
| 0.987 | 20 | 0.1% |
| 0.9961 | 19 | 0.1% |
| 0.974 | 16 | 0.1% |
| Other values (24055) | 26247 |
| Value | Count | Frequency (%) |
| -3.9355 | 1 | |
| -3.92625 | 1 | |
| -2.498716667 | 1 | |
| -1.73208 | 1 | |
| -1.5335 | 1 |
| Value | Count | Frequency (%) |
| 1.876742857 | 1 | |
| 1.7653 | 1 | |
| 1.75 | 1 | |
| 1.60962 | 1 | |
| 1.47225 | 1 |
| Distinct | 24440 |
|---|---|
| Distinct (%) | 81.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.640378923 |
| Minimum | -4.14685 |
|---|---|
| Maximum | 2.3745 |
| Zeros | 14 |
| Zeros (%) | < 0.1% |
| Negative | 1018 |
| Negative (%) | 3.4% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -4.14685 |
|---|---|
| 5-th percentile | 0.0137075 |
| Q1 | 0.3319189951 |
| median | 0.757585 |
| Q3 | 0.985672 |
| 95-th percentile | 1 |
| Maximum | 2.3745 |
| Range | 6.52135 |
| Interquartile range (IQR) | 0.6537530049 |
Descriptive statistics
| Standard deviation | 0.3686953778 |
|---|---|
| Coefficient of variation (CV) | 0.5757456477 |
| Kurtosis | 1.3571981 |
| Mean | 0.640378923 |
| Median Absolute Deviation (MAD) | 0.2407091176 |
| Skewness | -0.8346489936 |
| Sum | 19202.40239 |
| Variance | 0.1359362816 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3194 | 10.7% |
| 0.961 | 47 | 0.2% |
| 0.9922 | 45 | 0.2% |
| 0.9805 | 36 | 0.1% |
| 0.995125 | 27 | 0.1% |
| 0.987 | 20 | 0.1% |
| 0.997 | 20 | 0.1% |
| 0.99805 | 18 | 0.1% |
| 0.9961 | 16 | 0.1% |
| 0.9875 | 15 | 0.1% |
| Other values (24430) | 26548 |
| Value | Count | Frequency (%) |
| -4.14685 | 1 | |
| -3.6455 | 1 | |
| -2.71595 | 1 | |
| -2.170033333 | 1 | |
| -1.79824 | 1 |
| Value | Count | Frequency (%) |
| 2.3745 | 1 | |
| 2.0433 | 1 | |
| 1.8758 | 1 | |
| 1.75 | 1 | |
| 1.653846154 | 1 |
| Distinct | 24726 |
|---|---|
| Distinct (%) | 82.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6076811577 |
| Minimum | -9.688575 |
|---|---|
| Maximum | 2.0251 |
| Zeros | 5 |
| Zeros (%) | < 0.1% |
| Negative | 1583 |
| Negative (%) | 5.3% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -9.688575 |
|---|---|
| 5-th percentile | -0.00171 |
| Q1 | 0.2445238871 |
| median | 0.7264225 |
| Q3 | 0.98395325 |
| 95-th percentile | 1 |
| Maximum | 2.0251 |
| Range | 11.713675 |
| Interquartile range (IQR) | 0.7394293629 |
Descriptive statistics
| Standard deviation | 0.3964647048 |
|---|---|
| Coefficient of variation (CV) | 0.6524222444 |
| Kurtosis | 16.25047825 |
| Mean | 0.6076811577 |
| Median Absolute Deviation (MAD) | 0.2716325 |
| Skewness | -1.326840115 |
| Sum | 18221.9272 |
| Variance | 0.1571842621 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2869 | 9.6% |
| 0.9922 | 41 | 0.1% |
| 0.9805 | 38 | 0.1% |
| 0.961 | 31 | 0.1% |
| 0.995125 | 30 | 0.1% |
| 0.987 | 30 | 0.1% |
| 0.997 | 21 | 0.1% |
| 0.9961 | 20 | 0.1% |
| 0.9875 | 19 | 0.1% |
| 0.99805 | 16 | 0.1% |
| Other values (24716) | 26871 |
| Value | Count | Frequency (%) |
| -9.688575 | 1 | |
| -4.3914 | 1 | |
| -3.55805 | 1 | |
| -2.8632 | 1 | |
| -2.47605 | 1 |
| Value | Count | Frequency (%) |
| 2.0251 | 1 | |
| 1.925082353 | 1 | |
| 1.75 | 1 | |
| 1.1872 | 1 | |
| 1.16962 | 1 |
| Distinct | 25076 |
|---|---|
| Distinct (%) | 83.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5887250858 |
| Minimum | -5.3805 |
|---|---|
| Maximum | 2.39554 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 1940 |
| Negative (%) | 6.5% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -5.3805 |
|---|---|
| 5-th percentile | -0.01036809955 |
| Q1 | 0.1933329545 |
| median | 0.7035604545 |
| Q3 | 0.981665 |
| 95-th percentile | 1 |
| Maximum | 2.39554 |
| Range | 7.77604 |
| Interquartile range (IQR) | 0.7883320455 |
Descriptive statistics
| Standard deviation | 0.4045605353 |
|---|---|
| Coefficient of variation (CV) | 0.6871807319 |
| Kurtosis | 2.673035351 |
| Mean | 0.5887250858 |
| Median Absolute Deviation (MAD) | 0.2937837677 |
| Skewness | -0.8175524053 |
| Sum | 17653.51042 |
| Variance | 0.1636692267 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2504 | 8.4% |
| 0.9922 | 35 | 0.1% |
| 0.987 | 29 | 0.1% |
| 0.9805 | 29 | 0.1% |
| 0.995125 | 28 | 0.1% |
| 0.961 | 24 | 0.1% |
| 0.997 | 19 | 0.1% |
| 0.9961 | 15 | 0.1% |
| 0.9930555556 | 14 | < 0.1% |
| 0.9875 | 14 | < 0.1% |
| Other values (25066) | 27275 |
| Value | Count | Frequency (%) |
| -5.3805 | 1 | |
| -4.4562 | 1 | |
| -3.2676 | 1 | |
| -2.7631 | 1 | |
| -2.7566 | 1 |
| Value | Count | Frequency (%) |
| 2.39554 | 1 | |
| 1.964657143 | 1 | |
| 1.2989 | 1 | |
| 1.271733333 | 1 | |
| 1.254428571 | 1 |
| Distinct | 25557 |
|---|---|
| Distinct (%) | 85.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.576078443 |
| Minimum | -5.4553 |
|---|---|
| Maximum | 1.619892 |
| Zeros | 8 |
| Zeros (%) | < 0.1% |
| Negative | 2115 |
| Negative (%) | 7.1% |
| Memory size | 468.5 KiB |
Quantile statistics
| Minimum | -5.4553 |
|---|---|
| 5-th percentile | -0.01313026316 |
| Q1 | 0.1700104167 |
| median | 0.685640724 |
| Q3 | 0.9779548913 |
| 95-th percentile | 1 |
| Maximum | 1.619892 |
| Range | 7.075192 |
| Interquartile range (IQR) | 0.8079444746 |
Descriptive statistics
| Standard deviation | 0.4114713624 |
|---|---|
| Coefficient of variation (CV) | 0.7142627317 |
| Kurtosis | 2.618953272 |
| Mean | 0.576078443 |
| Median Absolute Deviation (MAD) | 0.3096397522 |
| Skewness | -0.8186595073 |
| Sum | 17274.28819 |
| Variance | 0.1693086821 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2004 | 6.7% |
| 0.9922 | 37 | 0.1% |
| 0.9805 | 36 | 0.1% |
| 0.995125 | 26 | 0.1% |
| 0.9961 | 23 | 0.1% |
| 0.987 | 23 | 0.1% |
| 0.9875 | 21 | 0.1% |
| 0.997 | 19 | 0.1% |
| 0.961 | 17 | 0.1% |
| 0.9930555556 | 16 | 0.1% |
| Other values (25547) | 27764 |
| Value | Count | Frequency (%) |
| -5.4553 | 1 | |
| -4.3095 | 1 | |
| -3.1406 | 1 | |
| -2.9529 | 1 | |
| -2.7255 | 1 |
| Value | Count | Frequency (%) |
| 1.619892 | 1 | |
| 1.551933333 | 1 | |
| 1.2309 | 1 | |
| 1.2 | 1 | |
| 1.19604 | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| LIMIT_BAL | SEX | EDUCATION | MARRIAGE | AGE | PAY_0 | PAY_2 | PAY_3 | PAY_4 | PAY_5 | PAY_6 | BILL_AMT1 | BILL_AMT2 | BILL_AMT3 | BILL_AMT4 | BILL_AMT5 | BILL_AMT6 | PAY_AMT1 | PAY_AMT2 | PAY_AMT3 | PAY_AMT4 | PAY_AMT5 | PAY_AMT6 | Default | SE_MA | AgeBin | SE_AG | Closeness_6 | Closeness_5 | Closeness_4 | Closeness_3 | Closeness_2 | Closeness_1 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 20000 | 2 | 2 | 1 | 24 | 2 | 2 | -1 | -1 | -2 | -2 | 3913 | 3102 | 689 | 0 | 0 | 0 | 0 | 689 | 0 | 0 | 0 | 0 | 1 | 4 | 1 | 6 | 1.00 | 1.00 | 1.00 | 0.97 | 0.84 | 0.80 |
| 1 | 120000 | 2 | 2 | 2 | 26 | -1 | 2 | 0 | 0 | 0 | 2 | 2682 | 1725 | 2682 | 3272 | 3455 | 3261 | 0 | 1000 | 1000 | 1000 | 0 | 2000 | 1 | 5 | 1 | 6 | 0.97 | 0.97 | 0.97 | 0.98 | 0.99 | 0.98 |
| 2 | 90000 | 2 | 2 | 2 | 34 | 0 | 0 | 0 | 0 | 0 | 0 | 29239 | 14027 | 13559 | 14331 | 14948 | 15549 | 1518 | 1500 | 1000 | 1000 | 1000 | 5000 | 0 | 5 | 2 | 7 | 0.83 | 0.83 | 0.84 | 0.85 | 0.84 | 0.68 |
| 3 | 50000 | 2 | 2 | 1 | 37 | 0 | 0 | 0 | 0 | 0 | 0 | 46990 | 48233 | 49291 | 28314 | 28959 | 29547 | 2000 | 2019 | 1200 | 1100 | 1069 | 1000 | 0 | 4 | 2 | 7 | 0.41 | 0.42 | 0.43 | 0.01 | 0.04 | 0.06 |
| 4 | 50000 | 1 | 2 | 1 | 57 | -1 | 0 | -1 | 0 | 0 | 0 | 8617 | 5670 | 35835 | 20940 | 19146 | 19131 | 2000 | 36681 | 10000 | 9000 | 689 | 679 | 0 | 1 | 4 | 4 | 0.62 | 0.62 | 0.58 | 0.28 | 0.89 | 0.83 |
| 5 | 50000 | 1 | 1 | 2 | 37 | 0 | 0 | 0 | 0 | 0 | 0 | 64400 | 57069 | 57608 | 19394 | 19619 | 20024 | 2500 | 1815 | 657 | 1000 | 1000 | 800 | 0 | 2 | 2 | 2 | 0.60 | 0.61 | 0.61 | -0.15 | -0.14 | -0.29 |
| 6 | 500000 | 1 | 1 | 2 | 29 | 0 | 0 | 0 | 0 | 0 | 0 | 367965 | 412023 | 445007 | 542653 | 483003 | 473944 | 55000 | 40000 | 38000 | 20239 | 13750 | 13770 | 0 | 2 | 1 | 1 | 0.05 | 0.03 | -0.09 | 0.11 | 0.18 | 0.26 |
| 7 | 100000 | 2 | 2 | 2 | 23 | 0 | -1 | -1 | 0 | 0 | -1 | 11876 | 380 | 601 | 221 | -159 | 567 | 380 | 601 | 0 | 581 | 1687 | 1542 | 0 | 5 | 1 | 6 | 0.99 | 1.00 | 1.00 | 0.99 | 1.00 | 0.88 |
| 8 | 140000 | 2 | 3 | 1 | 28 | 0 | 0 | 2 | 0 | 0 | 0 | 11285 | 14096 | 12108 | 12211 | 11793 | 3719 | 3329 | 0 | 432 | 1000 | 1000 | 1000 | 0 | 4 | 1 | 6 | 0.97 | 0.92 | 0.91 | 0.91 | 0.90 | 0.92 |
| 9 | 20000 | 1 | 3 | 2 | 35 | -2 | -2 | -2 | -2 | -1 | -1 | 0 | 0 | 0 | 0 | 13007 | 13912 | 0 | 0 | 0 | 13007 | 1122 | 0 | 0 | 2 | 2 | 2 | 0.30 | 0.35 | 1.00 | 1.00 | 1.00 | 1.00 |
Last rows
| LIMIT_BAL | SEX | EDUCATION | MARRIAGE | AGE | PAY_0 | PAY_2 | PAY_3 | PAY_4 | PAY_5 | PAY_6 | BILL_AMT1 | BILL_AMT2 | BILL_AMT3 | BILL_AMT4 | BILL_AMT5 | BILL_AMT6 | PAY_AMT1 | PAY_AMT2 | PAY_AMT3 | PAY_AMT4 | PAY_AMT5 | PAY_AMT6 | Default | SE_MA | AgeBin | SE_AG | Closeness_6 | Closeness_5 | Closeness_4 | Closeness_3 | Closeness_2 | Closeness_1 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 29990 | 140000 | 1 | 2 | 1 | 41 | 0 | 0 | 0 | 0 | 0 | 0 | 138325 | 137142 | 139110 | 138262 | 49675 | 46121 | 6000 | 7000 | 4228 | 1505 | 2000 | 2000 | 0 | 1 | 3 | 3 | 0.67 | 0.65 | 0.01 | 6.36e-03 | 0.02 | 0.01 |
| 29991 | 210000 | 1 | 2 | 1 | 34 | 3 | 2 | 2 | 2 | 2 | 2 | 2500 | 2500 | 2500 | 2500 | 2500 | 2500 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 2 | 2 | 0.99 | 0.99 | 0.99 | 9.88e-01 | 0.99 | 0.99 |
| 29992 | 10000 | 1 | 3 | 1 | 43 | 0 | 0 | 0 | -2 | -2 | -2 | 8802 | 10400 | 0 | 0 | 0 | 0 | 2000 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 3 | 3 | 1.00 | 1.00 | 1.00 | 1.00e+00 | -0.04 | 0.12 |
| 29993 | 100000 | 1 | 1 | 2 | 38 | 0 | -1 | -1 | 0 | 0 | 0 | 3042 | 1427 | 102996 | 70626 | 69473 | 55004 | 2000 | 111784 | 4000 | 3000 | 2000 | 2000 | 0 | 2 | 2 | 2 | 0.45 | 0.31 | 0.29 | -3.00e-02 | 0.99 | 0.97 |
| 29994 | 80000 | 1 | 2 | 2 | 34 | 2 | 2 | 2 | 2 | 2 | 2 | 72557 | 77708 | 79384 | 77519 | 82607 | 81158 | 7000 | 3500 | 0 | 7000 | 0 | 4000 | 1 | 2 | 2 | 2 | -0.01 | -0.03 | 0.03 | 7.70e-03 | 0.03 | 0.09 |
| 29995 | 220000 | 1 | 3 | 1 | 39 | 0 | 0 | 0 | 0 | 0 | 0 | 188948 | 192815 | 208365 | 88004 | 31237 | 15980 | 8500 | 20000 | 5003 | 3047 | 5000 | 1000 | 0 | 1 | 2 | 2 | 0.93 | 0.86 | 0.60 | 5.29e-02 | 0.12 | 0.14 |
| 29996 | 150000 | 1 | 3 | 2 | 43 | -1 | -1 | -1 | -1 | 0 | 0 | 1683 | 1828 | 3502 | 8979 | 5190 | 0 | 1837 | 3526 | 8998 | 129 | 0 | 0 | 0 | 2 | 3 | 3 | 1.00 | 0.97 | 0.94 | 9.77e-01 | 0.99 | 0.99 |
| 29997 | 30000 | 1 | 2 | 2 | 37 | 4 | 3 | 2 | -1 | 0 | 0 | 3565 | 3356 | 2758 | 20878 | 20582 | 19357 | 0 | 0 | 22000 | 4200 | 2000 | 3100 | 1 | 2 | 2 | 2 | 0.35 | 0.31 | 0.30 | 9.08e-01 | 0.89 | 0.88 |
| 29998 | 80000 | 1 | 3 | 1 | 41 | 1 | -1 | 0 | 0 | 0 | -1 | -1645 | 78379 | 76304 | 52774 | 11855 | 48944 | 85900 | 3409 | 1178 | 1926 | 52964 | 1804 | 1 | 1 | 3 | 3 | 0.39 | 0.85 | 0.34 | 4.62e-02 | 0.02 | 1.02 |
| 29999 | 50000 | 1 | 2 | 1 | 46 | 0 | 0 | 0 | 0 | 0 | 0 | 47929 | 48905 | 49764 | 36535 | 32428 | 15313 | 2078 | 1800 | 1430 | 1000 | 1000 | 1000 | 1 | 1 | 3 | 3 | 0.69 | 0.35 | 0.27 | 4.72e-03 | 0.02 | 0.04 |
Most frequent
| LIMIT_BAL | SEX | EDUCATION | MARRIAGE | AGE | PAY_0 | PAY_2 | PAY_3 | PAY_4 | PAY_5 | PAY_6 | BILL_AMT1 | BILL_AMT2 | BILL_AMT3 | BILL_AMT4 | BILL_AMT5 | BILL_AMT6 | PAY_AMT1 | PAY_AMT2 | PAY_AMT3 | PAY_AMT4 | PAY_AMT5 | PAY_AMT6 | Default | SE_MA | AgeBin | SE_AG | Closeness_6 | Closeness_5 | Closeness_4 | Closeness_3 | Closeness_2 | Closeness_1 | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 20000 | 1 | 2 | 2 | 24 | 2 | 2 | 4 | 4 | 4 | 4 | 1650 | 1650 | 1650 | 1650 | 1650 | 1650 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 | 1 | 1 | 0.92 | 0.92 | 0.92 | 0.92 | 0.92 | 0.92 | 2 |
| 1 | 50000 | 1 | 2 | 2 | 26 | 1 | -2 | -2 | -2 | -2 | -2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | 1 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 2 |
| 2 | 50000 | 2 | 1 | 2 | 23 | 1 | -2 | -2 | -2 | -2 | -2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 | 1 | 6 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 2 |
| 3 | 80000 | 2 | 2 | 1 | 31 | -2 | -2 | -2 | -2 | -2 | -2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 2 | 7 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 2 |
| 4 | 80000 | 2 | 2 | 2 | 25 | -2 | -2 | -2 | -2 | -2 | -2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 | 1 | 6 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 2 |
| 5 | 80000 | 2 | 3 | 1 | 42 | -2 | -2 | -2 | -2 | -2 | -2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 3 | 8 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 2 |
| 6 | 90000 | 2 | 1 | 2 | 31 | 1 | -2 | -2 | -2 | -2 | -2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 | 2 | 7 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 2 |
| 7 | 100000 | 2 | 2 | 1 | 49 | 1 | -2 | -2 | -2 | -2 | -2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 3 | 8 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 2 |
| 8 | 110000 | 2 | 1 | 2 | 31 | 1 | -2 | -2 | -2 | -2 | -2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 | 2 | 7 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 2 |
| 9 | 140000 | 1 | 1 | 2 | 29 | 1 | -2 | -2 | -2 | -2 | -2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | 1 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 2 |